Are discoveries spurious? Distributions of maximum spurious correlations and their applications
نویسندگان
چکیده
منابع مشابه
Guarding from Spurious Discoveries in High Dimension
Many data-mining and statistical machine learning algorithms have been developed to select a subset of covariates to associate with a response variable. Spurious discoveries can easily arise in high-dimensional data analysis due to enormous possibilities of such selections. How can we know statistically our discoveries better than those by chance? In this paper, we define a measure of goodness ...
متن کاملGuarding against Spurious Discoveries in High Dimensions
Many data-mining and statistical machine learning algorithms have been developed to select a subset of covariates to associate with a response variable. Spurious discoveries can easily arise in high-dimensional data analysis due to enormous possibilities of such selections. How can we know statistically our discoveries better than those by chance? In this paper, we define a measure of goodness ...
متن کاملSpurious Correlations in Mathematical Thinking
How does detection of correlational structure affect mathematical thinking and learning? When does correlational information lead to erroneous problem solving? Are experienced students susceptible to misleading correlations? This work attempts to answer these questions by examining a source of systematic errors termed the spurious-correlation effect. This effect is hypothesized to occur when a ...
متن کاملSpurious Hyperleukocytosis
Hyperleukocytosis is an oncological emergency but is extremely rare in non-malignant conditions. Nucleated RBCs give rise to spuriously high total leucocyte count and cause clinical dilemma. Thalassemia major patients are known to have leucocytosis even after correction for nucleated RBCs. We report a case of undiagnosed Thalassemia major in a 4 month old infant with total leucocyte count highe...
متن کاملSpurious correlations and inference in landscape genetics.
Reliable interpretation of landscape genetic analyses depends on statistical methods that have high power to identify the correct process driving gene flow while rejecting incorrect alternative hypotheses. Little is known about statistical power and inference in individual-based landscape genetics. Our objective was to evaluate the power of causal-modelling with partial Mantel tests in individu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Annals of Statistics
سال: 2018
ISSN: 0090-5364
DOI: 10.1214/17-aos1575